Naval Research Laboratory, Code 5514 Navy Center for Applied Research in Artificial Intelligence 4555 Overlook Ave., S.W., Washington, D.C. 20375-5320
نویسندگان
چکیده
Reinforcement Learning Methods (RLMs) typically select candidate solutions stochastically based on a credibility space of hypotheses which the RLM maintains, either implicitly or explicitly. RLMs typically have both inductive and deductive aspects: they inductively improve their credibility space on a stage-by stage basis; they deductively select an appropriate response to incoming stimuli using their credibility space. In this sense, RLMs share some learning attributes in common with active, incremental concept learners. Unlike some concept learners that employ deterministic procedures for selecting hypotheses, however, the evaluations of hypotheses provided to RLMs are often uncertain, either due to noisy environments, or due to summary evaluations which occur after a sequence of learner-environment interactions. This paper examines issues of inductive learning bias in this context experimentally. Specifically, the paper addresses inductive learning biases in the context of a simple RLM called a Collective Learning Automaton (CLA). The CLA learns the shortest path through a small network. The research points out some of the difficulties of finding performance measures that indicate the strongest, correct biases for the automaton.
منابع مشابه
Distributed Mobile Sensor Networks for Hazardous Applications
1Research Department for Underwater Acoustics and Marine Geophysics, Bundeswehr Technical Centre for Ships and Naval Weapons, Naval Technology and Research (WTD 71), Klausdorfer Weg 2, 24148 Kiel, Germany 2Navy Center for Applied Research in Artificial Intelligence, Naval Research Laboratory, 4555 Overlook Avenue S.W., Washington, DC 20375, USA 3Acoustic Research Laboratory, National University...
متن کاملStacking nematic elastomers for artificial muscle applications
Stacking nematic elastomers for artificial muscle applications Christopher M. Spillmann a,∗, Jawad Naciri a, Brett D. Martin a, Waleed Farahat b, Hugh Herr b, Banahalli R. Ratna a a Naval Research Laboratory, Center for Bio/Molecular Science and Engineering, 4555 Overlook Ave SW, Washington, DC 20375, United States b Massachusetts Institute of Technology, Biomechatronics Group, 20 Ames Street, ...
متن کاملACT-R Workshop
Dario D. Salvucci ([email protected]), Michael D. Byrne ([email protected]), Christian Lebiere ([email protected]), Niels A. Taatgen ([email protected]) J. Gregory Trafton ([email protected]) 1 College of Computing & Informatics, Drexel University, 3141 Chestnut St., Philadelphia, PA 19104, USA 2 Department of Psychology, Rice University, 6100 Main St., Houston, TX 77005, USA 3 Department of Psyc...
متن کاملNo anticorrelation between cyclotron line energy and X-ray flux in 4U 0115+634
1 Dr. Karl Remeis-Observatory & ECAP, Universität Erlangen-Nürnberg, Sternwartstr. 7, 96049 Bamberg, Germany email: [email protected] 2 ISDC Data Center for Astrophysics, University of Geneva, 16 chemin d’Écogia, 1290 Versoix, Switzerland 3 Leibniz-Institut für Astrophysik Potsdam, An der Sternwarte 16, 14482 Potsdam, Germany 4 School of Physics, Astronomy, and Comput...
متن کاملEUV emission lines and diagnostics observed with Hinode / EIS
P. R. Young, G. Del Zanna, H. E. Mason, K. P. Dere, E. Landi, M. Landini, G. A. Doschek, 1STFC Rutherford Appleton Laboratory, Chilton, Didcot, Oxfor 2University College London, Department of Space and Climate Physics, 3DAMTP, Centre for Mathematical Sciences, 4George Mason University, 4400 University Dr., Fairfax 5Artep, Inc at Naval Research Laboratory, 4555 Overlook Ave. S.W., 6Code 7670, Na...
متن کامل